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Abstract. In this article we establish a large deviation prin- 
ciple for the family {z/^ : e G (0,1)} of distributions of the 
scaled stochastic processes {P_i^^^Zt}t<i, where (Zjtg[o,i] is a 
square-integrable martingale over Brownian filtration and {Pt)t>Q 
is the Ornstein-Uhlenbeck semigroup. The rate function is iden- 
tified as well in terms of the Wiener-Ito chaos decomposition of 
the terminal value Zi. The result is established by developing a 
continuity theorem for large deviations, together with two essen- 
tial tools, the hypercontractivity of the Ornstein-Uhlenbeck semi- 
group and Lyons' continuity theorem for solutions of Stratonovich 
type stochastic differential equations. 
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1 Introduction 

In 1938, H. Cramer [19] published a result on the probability of large de- 
viations in the law of large numbers for sums of independent real random 
variables, and some years later, H. Chernoff [6J proved a general Caramer's 
theorem. Cramer's paper marked the beginning of the study of large de- 
viations of distributions towards their limiting law. Schilder [35], mainly 
developed from his Ph. D. thesis, proved a beautiful theorem for large de- 
viations of Brownian motion, and thus opened the study of large deviations 
for probability measures on spaces of continuous paths. Schilder's analy- 
sis in [2S] proved fundamental in dealing with functional integrations over 
function spaces. It took some years, however, in particular in the hands of 
Azencott [1], Donsker-Varadhan [38], [39], Freidlin-Ventcel [15], [37], Stroock 
[36j . Deuschel-Stroock [8], Dembo-Zeitouni [7], Dupuis-Ellis fl] and etc. to 
turn the results of large deviations and the techniques developed to prove 
them into what nowadays we may call the theory of large deviations. 

Large deviation principles have been established for a large class of dis- 
tributions, mainly by exploiting Markov property or/and Gaussian nature of 
underlying stochastic processes, see [1], [2], [3], [7], [8], [9], [10], [13], [15] . 
[25], [31], [32] etc. for a small sample. 
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Let Wq = Co([0, 1]; R^) be the Banach space of all continuous paths in 
started at with running time [0, 1], equipped with the uniform norm 

||w|| = sup \w{t)\ Vw G Wq. 
te[o,i] 

Let H = Hq{[0, 1]; R'^) be the subspace of h E Wq such that its generalized 
derivative h G ^^^[0, 1]. if is a Hilbert space under the norm 



Hi = \l I \h{t)\'^dt Wh G H. 

Let {wt)t>o be the coordinate process on Wq 

wt{x) = x{t) Vx G Wq and t G [0, 1] 

and !F^ = a{ws : s < t} be the filtration generated by {wt)t>o- Then 
coincides with the Borel cr-algebra ;B(Wq) on Wq. The Wiener measure 
P"^ (see for example [23]), where the superscript w an attribute to Wiener 
who first constructed the law of Brownian motion as a measure on the space 
of continuous paths, is the unique probability on (Wq,S(Wq)) such that 
the coordinate process {wt) is a Brownian motion started from 0. Another, 
but equivalent, description of P^, is that P^ is the unique probability on 
(WQ,i3(WQ)) with characteristic function 

[ e^'(")P"'(rfa;) = e-^ll'llni V/ G (W^)* 



where (Wq)* is the dual space of Wq, and the natural imbedding (Wq)* 
H ^ Wq has been used. 

The Hilbert space Hq{[0, 1]; R) is called the Cameron-Martin space, and 
the probability space (Wq, J^i, P^) is called the Wiener space on R, where 
J-'i is the completion of JFj* under P^, and {J-'t)te[o,i] is the smallest a-algebra 
containing J-'^ and the events in J-'i with probability zero. {J-'t)t£[o,i] is the 
Brownian filtration. 

For each e > 0, P^ denotes the distribution of the scaled Brownian motion 
{\/ewt)Q<t<i, that is, P'^ is the probability measure on (Wq,JFi) such that 

j e^^^'^^P^idx) = [ e^'(^")p'"(dx) 

= e-m\%i v/G (Wq^)*. 
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It is obvious that, as e | 0, approaches zero (the probabihty measure with 
the support containing only one path: x{t) = for all t), at an exponential 
rate. The family of distributions, {P^ : e > 0}, satisfies the large deviation 
principle with rate function 

1(h) = I l/ol^l'W^^ ifheH, 

\ oo, otherwise. 
By a large deviation principle with rate function J, we mean that 

WlogPf (F) < - inf /(u;) (1.2) 

and 

lim^ioe log P,"'(0) >- inf J(w) (1.3) 

uieo 

for any closed subset F and open set O. See [7], [8] for further information 
about the general theory of large deviations. 

Brownian motion is a typical example among Markov processes and Gaus- 
sian processes, and is also a good example of continuous martingales. It is 
thus natural to seek for large deviation results for laws of properly scaled 
martingales. To the best knowledge of the present authors, it remains an 
open question whether a large deviation principle holds for martingales, see 
however [T7j, [18], [27] and the references therein for results on exponential 
tail estimates for martingales in discrete-time. 

This article presents a solution of this problem: we are going to establish a 
large deviation principle for square-integrable martingales over the Brownian 
filtration. 

Consider a square-integrable martingale {Yt)t^[o^i] (with initial zero) on 
(Wq, jFi, jFj, P'"), then, by the martingale representation theorem (Theorem 
3.5, page 201, [M]), (^t)te[o,i] is continuous and can be represented as an Ito 
integral against Brownian motion, i.e. 

Yt= fsdws 
Jo 

where {ft)t>o is a predictable process on (W^,J^i, J't,P'"). In particular, 
(yt)t>o is a measurable function of Brownian motion {wt)t>o- 

It is obvious that the scaling ^/e, which is correct for Brownian mo- 
tion, does not apply to an arbitrary martingale. Consider the Wiener- 
Ito chaos decomposition (see [21], [lOj) of a square-integrable martingale 
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Yf — P^'{Zi\J-'t) with mean zero, where P^{-\J-'t) is the conditional expecta- 
tion. Since Yi G L^(Wq, JF^, P'"^), so that (for simphcity, let us consider the 
case that d = 1, but our arguments equally apply to higher dimensions) 

oo „ 

^1 = ^ I fkih, • • • , tk)dwt^ ■ ■ ■ dwt^ (1.4) 

i^^i Jo<ti< -<tk<i 

where k-th term, a multiple Wiener-Ito integral, belongs to k-th Wiener 
chaos, and the integrands fk are symmetric functions in L^[0, 1]*^. Clearly 

oo „ 

= / fk{ti, ■■■ , tk)dwti ■ ■ ■ dwt^. 

According to Schilder's theorem, one simple while reasonable re-scaling for 
such martingale is multiplying k-th term in the decomposition by There- 
fore, one possible scaling for martingales should be P-iog^Yt, where {Pt)t>o 
is the Ornstein-Uhlenbeck semigroup on (Wq, ^i, P'^). This is the first place 
the Ornstein-Uhlenbeck semigroup comes into our study. 

Let us define the following mapping F : Hq{[0, 1]; -R^) Wq by 

oo „ 

= J2 • • • ' ^'^)^(^i) • • • Htk)dh ■ ■ ■ dtk. 

k=i Jo<ti< -<tk<t 

The main result of the paper is the following large deviation principle. 

Theorem 1.1 Let ^ e L^(Wo, ^i, P"') which has the Wiener-Itd's decom- 
position 

oo „ 

C = / ■^'^(^1' ■ ■ ■ ' ^k)dwt^ ■ ■ ■ dwt^ . 

k=l 0<ti<-<tk<t 

Let Yt — P^ m^t) for t e [0, 1], and he the distribution of (-P-iog7^^)t<i- 
Then {u^ : £ e (0, 1)} satisfies the large deviation principle with rate function 

I'{w) = inf {I{h) \ heH such that F{h) = w} , 

where I{h) = for heH. 

A special case of the above theorem, namely, for distributions of multiple 
Wiener-Ito 's integrals of the following form 

= / f{ti, ■■■ , tk)dwt^ ■ ■ ■ dwt^ 

Jo<ti<-<tk<t 
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has been established in M. Ledoux [25]. Nualart and etl. [31] extended to 
an even larger class of multiple Wiener-Ito's integrals on an abstract Wiener 

k 

space. These authors used the same scaling but only for single iterated 
integrals, we however believe that their methods may be modified to develop 
a large deviation principle for finite sums of multiple Wiener- Ito integrals, or 
more precisely for the laws of martingales of the following form 

k=i Jo<ti<-<tk<t 
Our study is based on the following simple observation: the scaling for 

k 

k-ih. term is itself sub-exponential for large fc, which ensures the laws of 
martingales 

= ^^^ I /fc(ii, ■ ■ ■ , tk)dwt^ ■ ■ ■ dwt,, 

k=l Jo<tx<-<tu<t 

though the sum is infinite, remain to satisfy the large deviation principle. 

In this stage we would like to describe the main steps of our proof of The- 
orem 11.11 which are necessary long and involve many technical issues. The 
first step is, of course, to approximate by a good family of martingales. 
More precisely, let ^ G L^(Wq, jFi, P"") have the decomposition 

oo 



so that 



fk{ti,--- ,tk)dwt^---dwt^^ 



oo ^ 

fc=i 



For each n we may choose an and 



fkiti,- ■ ■ ,tk)dwt^- ■ -dwt^. 

k=l "'0<ti<---<tfe<l 

such that ^„ — ^ in L^(Wq, jFi, P'"). The symmetric functions may be 
chosen such that — fk in for each as n — oo. We can require that 
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all are smooth enough with bounded derivatives (up to order 4 is enough) 
on [0, 1], and moreover, we can assume that fj! have a product form 

31,- Jfe=l 

where C"^'^.. are constants and Nn are natural numbers. Thanks to the hy- 
percontractivity of the Ornstein-Uhlenbeck semigroup (see L. Gross [20j for 
more details), the corresponding martingales Y{n)l = P'^{C,n\^t) converges 
to exponentially. 

The next step is to show that Y{ny for each n satisfies the large deviation 
principle, and to identify its rate function explicitly, which will be achieved by 
using Lyons' continuity theorem ([2B], see also [22], and excellent recent books 
[TB] . [5U] etc.), Schilder's large deviation principle in p- variation distance (see 
[26]) together with a simple application of Varadhan's contraction principle. 
More precisely we demonstrate that, for each n, Y{ny may be realized (or 
more precisely lifted) as a continuous function on the space of geometric 
rough paths, with respect to a variation distance. This is the precise version 
of what belonging to the folklore that Startonovich's integrals are continuous 
functions of Brownian motion paths. However we should emphasize that the 
continuity here must be understood in terms of Lyons' p-variation distance, 
rather than the uniform norm, see Proposition 14.61 below for a more precise 
statement. 

Nevertheless, it turns out that the rate function governing the large de- 
viations of {Y{ny : e G (0, 1)} is given by 

In{w) = M{I{h) : FM = w} 

where 

N„ „ 

Fn{h)t = 5^ / fkih, ■ ■ ■ , tk)h{ti) ■ ■ ■ h{tk)dwt, ■ ■ ■ dwt,. 

It is easy to see that F„ converges uniformly on any level set of / with 
respect to the uniform norm, but in the uniform norm F„ is not continu- 
ous from H C Wq to Wq. Indeed there is no continuous extension of F„ 
to the whole space Wq in general. On the other hand, we may lift the 
mappings F„ to the space of rough paths, that is, F„ is continuous in the 
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p-variation distance, but then in general F„ does not converge with respect 
to the p-variation metric as we do not have control over the derivatives of 
the integrands (fc = 1, 2, ■ ■ ■ ). Therefore, the existed (extended or gener- 
alized) contraction principles, which require that F„ are continuous and Fn 
converges uniformly on level sets of /, do not apply to the present case to 
deduce a large deviation principle for {V^ : £ G (0, 1)}. 

The main technical tool established in Section 2, a continuity theorem for 
large deviations which we believe has independent interest by its own, how- 
ever, allows us to prove the large deviation principle for {Y^ : £ G (0, 1)}. In 
Section 3, we show that the hypercontractivity of the Ornstein-Uhlenbeck op- 
erator allows us to establish the exponential tightness of the family of scaled 
martingales, which is one of the main ingredients in our proof of the main 
result. In Section 4, we construct the Ito-Lyons mappings associated with 
multiple Wiener-Ito integrals, which thus makes another key step towards 
the proof of Theorem ll.li Finally in Section 5, we collect all technical esti- 
mates together to establish a large deviation principle for square-integrable 
martingales. 

2 A continuity theorem for large deviations 

An important method in the theory of large deviations is the contraction 
principle, formulated by S. R. S. Varadhan [39]. Suppose {Z'^ : e G (0,1)} 
is a family of random variables in a Polish space E which satisfies the large 
deviation principle with a good rate function /, and suppose F : E ^ E' is a. 
continuous mapping, where E' is another Polish space, then {X^ : £ G (0, 1)}, 
where = FlZ"^), also satisfies the large deviation principle with rate 
function 

I'{s') = inf {/(s) -.seE such that F{s) = s' } 
for any s' & E'. 

However, in stochastic analysis, we often deal with Wiener functionals, for 
example, strong solutions to stochastic differential equations, which are only 
measurable rather than continuous, the above contraction principle is not 
sufficient in applications. Different generalizations of the contraction princi- 
ple, proposed by various authors over the past years, have been successfully 
applied to distributions of many interesting Wiener functionals. Among these 
generalizations, a typical one may be formulated as the following (see Theo- 
rem 4.2.23 in JJ\). Suppose Fn : E E' is a family of continuous mappings, 
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and {X^ : e G (0,1)} is a family of random variables in E' on (f2,jF, P), 
such that the continuous images Fn{Z'^) approaches in probability at an 
exponential rate 

TEr£logP{p'(F„(Z^),X^)} = -oo (2.1) 

n— »oo 

where p and p' are the distance functions on E and E' respectively. In 
addition, if F„ converges uniformly on any level set = {s : J(s) < L}, its 
limit is denoted by F (note that F is only well defined on the effective set 
H = {s : I{s) < oo}, but F is continuous on H G E), then the distributions 
of {X^ : £ G (0, 1)} obeys the large deviation principle with rate function 

I'{s') = inf {/(s) -.seH such that F{s) = s' } (2.2) 

for any s' E E' . 

In general we are interested in the following question. Suppose {X^ : e G 
(0, 1)} is a sequence of families of random variables in P on a probability 
space {VL,T,P) which converges to {X^ : e G (0, 1)} exponentially 

Th^£logP{p(X^,X^)} = -oo. (2.3) 

Suppose for each n, {X^ : e G (0, 1)} satisfies a large deviation principle 
with rate function /„. Then, according to Theorem 4.2.16, page 131, [7], the 
limiting distribution /i^ of X^ satisfies a weak large deviation principle with 
rate function 

/oo(s') = sup lim inf inf /„(s) (2.4) 

where P(sq,(5) is open ball centered at Sq with radius 5. Furthermore, if in 
addition I^o is a good rate function and for any closed set S one has 

inf /oo(s') < lim sup inf /„(s) (2.5) 

then : £ G (0, 1)} satisfies the large deviation principle with rate function 

However, in general, one can not deduce that a large deviation principle 
for limit processes {X^ : e E (0, 1)} under (12. 3p alone. 

In many applications, the rate functions /„ are often given as the images of 
a common good rate function / under some mappings Fn : H = {I < oo} 
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E. If Fn are not continuous with respect to the distance on E, then the right- 
hand side of fl2.4l) is difficult to compute, and it is hard then to verify the 
condition fl2.5p . The main goal of this section is to provide useful sufficient 
conditions in this situation, such that the limiting processes {X"^ : e E (0, 1)} 
satisfies a large deviation principle. To this end, we introduce a concept of 
rate-function mappings, see Definition l2.1l below. which are mappings sending 
a good rate function to another one. 

More precisely, we handle the following situation. Suppose X^, X^ are 
random variables in E, X^ converges to X^ exponentially, i.e. (12. 3p is sat- 
isfied, and for each n, X^ satisfies the large deviation principle with rate 
function given by 

W) = inf{/(s) : = s'} 

where / is a good rate function, and Fn are rate-function mappings, so that 
each In is a good rate function. In many interesting cases, Fn are not contin- 
uous in the topology on E. To ensure a large deviation principle of limiting 
processes to hold, the main conditions we impose on the family of rate- 
function mappings {Fn} are the followings. F„ converges uniformly on any 
level set {s : I{s) < L}, and all F„ are weakly continuous in a proper topol- 
ogy on the effective set H = {s : I{s) < oo}. For more details, see Theorem 
[23] below. 

2.1 Rate-function mappings 

Let E he a separable Banach space with its norm denoted by || ■ ||. The 
induced distance function is denoted by p, that is, p{s,s') = ||s — s'||. Let / 
be a good rate function on E, that is, / : — >■ [0, oo] such that for each real 
number L > 0, its level set = {s : I{s) < L} is compact in E. 

Let H = {s : I{s) < oo} be the effective set of the definition of J. We 
assume that 

1. if is a dense vector subspace of E, and there is a Hilbert norm || ■ \ \h 
on H, such that {H, || ■ ||//) is a Hilbert space. 

2. For each real L > 0, Kl is weakly compact, bounded and closed in 

{hA\-\\h). 

It is necessary that for all s E H, \ \s\ \ < C\\s\\h for some constant C. 
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The aim of this part is to study a class of mappings F : H ^ E so that 
If{s') = inf{/(s) -.seH such that F{s) = s'} (2.6) 
is again a good rate function on E. 

Definition 2.1 A mapping F : H E is a called a rate-function mapping, 
if the following conditions are satisfied. 

1) F is continuous with respect to the corresponding norms, i.e. 

^0 as\\s-s\\H^Q- 

Note that F may he not continuous as a mapping H G E to E with respect 
to the norm \ \ ■ \ \. 

2) F : H E is weakly continuous on any in the following sense: 
if Sk s weakly in H, where Sk G Kl (so that s G as well), then 
-^(sfe) — > F{s) weakly in E. 

3) For any L >0, the range F{Kl) = {s' : s e Kl such that F{s) = s'} 
is compact in {E, \ \ ■ ||). 

The following is the main use of the concept of rate-function mappings. 

Proposition 2.2 If F : H ^ E is a rate-function mapping, then Ip defined 
by Ii2.6\) is a good rate function on E. 

Proof. Let K'^ = {s' E E : If{s') < L}. We have to show that K'^ 
is a compact subset of E. To this end, choose any sequence {s^} in K'j^, 
such that If{s'^) < L. Then there are Sn E H such that = and 

H^n) < + ^- In particular Sn € -R'l+i, and C F{Kl+i) which is 

compact in E. Therefore we may assume that s'„ Sq in E, otherwise 
consider a convergent subsequence instead. Since {s„} C Kl+i so that we 
can extract a subsequence s„j. sq weakly in H as well as Sn^. — > sq in E. 
Since F is weakly continuous on K^+i, so that F{snJ — > -^(-So) weakly in E. 
Therefore we must have F{so) = s'q and, since / is a good rate function, so 
that 

If{s'o) < I{so) < lim„^oo/(s„) = L 

which implies that s'q G K'^^. Therefore K'^^ is compact. ■ 

The following proposition shows that the set of rate-function mappings 
is closed under the uniform convergence on level sets of /. 
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Proposition 2.3 Let Fn : H E be a sequence of rate-function mappings. 
Suppose that F„ converges uniformly on Ki for any L > 0, and let F denote 
the limiting function. Then F is also a rate-function mapping. 

Proof. As the uniform limit, F is clearly continuous from {H, \ \ ■ \ 
to {E, II ■ II). To show that F is weakly continuous on K^, consider any 
Sk, s G Kl, Sk —>■ s weakly in H. Since Fn —>■ F uniformly on Ki, for every 
e > 0, there is an A^i such that 

||F„(s)-F(s)||<| W>N,^seKL. (2.7) 

Let E*. Then 

\{^,Fisk)-F{s))\ < |(e,F(s,)-F„(sfc))| + |(e,F„(s)-F(s))| 

+ |(e,F„(sfc)-F„(s))| 

< f llelU' + l(e,i^n(..)-Fn(.))| 

fllelU* Vn>iVi 

as /c ^ oo, so that F is weakly continuous on K^. 

Next we prove that F{Kl) is compact in E. Consider any sequence 
{s'^} C F{Kl), so that F{sk) = s'j^ for some Sk G K^. For any e > 0, there is 
an A^^i such that fl2.7p holds. Hence 

\\F{Sk) - F{si)\\ < y + WEN^Sk) - FnM)\\ Vfc,/ . 

Since Fn^{Kl) is compact, we may assume that {Fj^^{sk)} is convergent, so 
that there is an N2 such that 

\\FNAsk)-FNM)\\<l ^k,l>N2 

and therefore 

\\F{sk)-F{si)\\<e VA;,/ >iVi ViVs. 

Hence F{sk) ^ s' in E for some s' G E. We need to show that s' G F{Kl). 

Since Kl is a compact subset of E and is weakly compact in if, we may 
assume that p{sk, Sq) for some Sq G Kl, and Sk — > sq weakly in H as 
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well, otherwise considering a convergent subsequence instead. By fl2.7p we 
have 

\\Fn{sk)-F{sk)\\<^ \/n>Ni 
so that, for any C, ^ E* 

|(e,F„(s,)-F(s,))|<|||e|U. Vn>iVi. 

Letting k ^ oo, then (^,F„(so)) and /(sfe) s' so that 

|(e,F„(so)-s')l<|llelU* Vn>iVi. 

Letting ^ oo in the above inequality, to obtain 

mF{s,)~s')\<'-m\E* 

for any e > and ^ & E*. Therefore we must have F{so) = s', so that 
s'eF{KL). m 

We end this sub-section by showing some examples of rate-function map- 
pings. 

Proposition 2.4 Let E = Co([0, 1]; i?^) be the Banach space of all con- 
tinuous paths in starting from zero, endowed with the uniform norm 
\\s\\ = supjg[Q;^] 1-5(^)1; and H be the subspace of all paths s E E which 
has a generalized derivative s G L^[0, 1], together with the Hilbert norm 

\\s\\h = \J Jq \s{t)\'^dt. Then H is a Hilbert space which is dense in E. Let 

/(s) = if s & H , otherwise I{s) = oo. Then I is a good rate function 

on E with effective set H . 

Let fn G L'^[0, 1]" be symmetric functions (n = 1,2, ■ ■ ■ ) such that 

OD ^ 

$^-|l/n|li2[o,l]n < OO. (2.8) 

n. 

n=l 

Define Fn and E : H E by 

N 

FN{h)t = J2 / /"(^l' ■ ■ ■ ' tnMti) ■ ■ ■ h{tn)dti ■■■dtn (2.9) 
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and 



oo „ 

mt = E / 

„=1 >^0<ii 



fnih, ■ ■ ■ , tn)h{h) ■ ■ ■ h{tn)dh ' ' ' dtn (2.10) 



<---<tn<i 



fortE [0,1], respectively. Then 
1 ) For any L > 0, F/v converges to F uniformly on in {E, 



IS 



sup ||Fjv(/i) - ^ as N oo. 



. That 
(2.11) 



2) All Fn, F are rate-function mappings. 

Proof. Let Kl = {h E E : I{h) < L} be the level set of the rate function 
/. First of all, we note that for each L > 0, Kl is a closed ball in H, and 
therefore Kl is not only compact in E (by the Sobolev imbedding), Kl is 
also convex and bounded in H, so that Kl is weakly compact in H, according 
to Milman's theorem and Theorem 1, page 126, 

It is easy to see that each F^ is continuous from {H, || ■ Hj^) to {E, || ■ ||). 
Let us prove that F^ F uniformly on any Kl- Let h G Kl- Then, by the 
Cauchy-Schwartz inequality 



E 

n=N+l 



fnik 



tn)h{ti) ■ ■ ■ h{tn)dti ■ ■ ■ dtr 



0<ti<---<t„<t 



< 



\ 



X 



E 



fn(tl 



tn^dti ■■ - dtn 



n=N+l >^0<ti<--<t„<t 



\ ^ J 



\h{ti)---h{tnWdti---dtn 



\ 



oo 

E All/' 



'"IIl2[o,i]'^ 



n=N+l 



oo 



■N+l 



< 



\ 



E 

n=N+l 



(2L)' 



E 



\h[o,iY 



--N+1 



where we have used the fact that, if (7 is a symmetric function on [0, t]^, then 
/ g{ti, ■ ■ ■ ,tk)dti ■ ■■dtk = ^ 



giti, ■ ■ ■ ,tk)dti ■■■dtk 
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as long as g is integrable. 
Therefore 

sup sup \F{h)t — Fi^{h)t\ -^0 as N ^ oo 

heK^ t<i 

which proves our claim. 

Let us prove that F/v is weakly continuous on Kl as stated in the lemma. 
To show the weak continuity of F^, we only need to show the weak continuity 
of Fjv = ^ which has a simple form, namely 

*(/i)t= / f{tl,--- ,tn)h{h)---h{tn)dh---dtn 

J0<tl<-<tn<t 

where / e L^[0, 1]" which is symmetric, and has the following form 

m 

fit,,.-- ,tn)^ E c'"'"''"fjM---fM 

Jl:'" i3n 

where fj^ G L^[0, 1]. Let ^ h weakly in H, where £ (so that 
h e Kl). Since / is symmetric, we thus have 







= ^ / fih,--- 


,tn) (jT'kitl) - - -1 


^! J[o,t]» 




^ m 

— _ (jjl,-,jn 








xf fniti)-- 




J[0,t]" 




m 

— (jjl,-,jn 


n-1 

X](l[0,«]/ii,^fc) 




(=1 


X(l[0,t]/j„_^_i,/l) 


• • • (l[0,t]/jn>^) 



where (/, h) = f{t)h{t)dt, which yields that 
1 —1 

jl,— ,jn 

X \ {l[o,t]fM_^,hk- h)\. 
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Therefore \'^{hk)t — '^{h)t\ — * for any t G [0, 1], as ^ h weakly in H, 
and {^(/ife)} is bounded uniformly as {hk} C Kl, so that ^(/ijt) ^(^) 
weakly in E. 

Therefore all F are weakly continuous. 

For any h e Kl and [s, t] C [0, 1], we set 



Af,,] = [o,r\[o,^r. 



Then 



F{h)t - F{h)s = 5" -r / /n(tl, • • • , tn)h{h) ■ ■ ■ h{tn)dh ■■■dtn 

n\ J An 

n=l [s,t\ 



SO that, by utilizing the Cauchy-Schwarz inequality 



\ n=l ^' -^^U \ n=l ^' 

I °° I } 

< y"-r / |/n(^i,--- ,^n)P^^^l•••<■ 
\ n=l ls,t] 



\2n 

\hi 



Since 



°° 1 /■ 

V-r / \Utir-- .tn)\^dh---dtr, 

n=l •^'^[s,t] 

°° 1 /" 



n=l 
< OO 



and for each n, according to the Lebesgue theorem 

/ |/n(^i, • • • , ^n) fdtx • • • di„ ^ as s t ^, 

[s,t] 

therefore we can conclude that the functions in F{Kl) are equi-continuous 
on [0, 1], and are bounded in E: 



OO ^ 

\ n=l 
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Therefore, according to Ascoli-Arzela's theorem (page 85, Section III-3, |41j). 
F{Kl) is pre-compact. We now need to show that F{Kl) is closed in E. Let 
{wn} be any sequence in F{Kl) which converges to w in E. Let hn G Kl 
such that F{hn) = Wn- Kl is weakly compact in H, so let us assume that 
hn —>■ h weakly in H, otherwise consider a weakly convergent subsequence 
instead. Since Ki is a closed and convex subset of H, so that h G K^. 
Therefore F{hn) = Wn weakly converges to F{h) in E. We thus must have 
F{h) = w, so that w G F{Kl), and F{Kl) is compact. ■ 

Of course, similar results hold in higher dimensions, where E = Co([0, 1]; R^), 
H = H^{[0, 1]; R"^) and the rate function I{h) = ^\\h\\l,. 

2.2 Continuity of large deviations 

We have thus developed necessary tools to formulate a continuity theorem 
for large deviation principles. 

Theorem 2.5 Suppose H (Z E and I is a good rate function satisfying the 
two conditions listed at the beginning of the last subsection 2.1. Let F„ : if — 
E be a sequence of rate-function mappings, and suppose that Fn converges 
to F uniformly on any level set = {s : I{s) < L}. For each n, let 
{X^ : e G (0,1)} (as well as {X^ : e G (0, l)}j be a family of random 
variables valued in E on a complete probability space {Q,J^,P). Suppose the 
following conditions are satisfied. 

1) {X^ : e G (0, 1)} converges to {X^ : e G (0, 1)} exponentially: for any 
(5 > 

lim£logP{p(X^,X") >(5} = -oo. (2.12) 

n^oo 

2) For each n, {X^ : e G (0, 1)} satisfies the large deviation principle with 
rate function Ip^ . 

Then, the distribution family {/ig : e G (0,1)} of the limiting process 
{X^ -.EE (0, 1)} satisfies the large deviation principle with rate function Ip- 

Proof. For simplicity, we use to denote Ip^ and /' for Ip. Let p be 
the distance function on E, i.e. 

p(s,s') = ||s - s'll Vs,s' G E, 

and for Sg G E and 5 > 0, B{s'(^,6) denote the open ball in E centered at Sq 
with radius 6. 
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Firstly we show the lower bound. Let O be an open subset of E, we need 
to prove that 

lim,|^£ log uAO) > - inf I'{s) . (2.13) 
It is easy to see that we only need to show 

hm,^,s\og^x,{B(s'„5)) > - ^ inf I'(s'). (2.14) 

for any s'q G O and 6 > such that -B(so, S) C O. We may assume that 
inf 5/^5(5/^ ,5/2) -^'("S') < 00, otherwise there is nothing to prove. By the triangle 
inequality one has for any A > 

^|p(^n,4)<^l < P{P(X^,X^)>A} 



3 



+p<|p(x^4)<A + ^ 



it follows that 



logP<ip(X^,4)<y 



< log 2 + log |p {p(X^, X^) > A} V P |p(X^ 4) < A + y 1 1 

< log 2 + log P {p(X^, X^) > A} V log P |p(X^ 4) < A + ^ j . 



Therefore 



£iogp<!p(x^,4)<y 



< £log2 

+ max|£logP{p(X^,X^) > A};£logp|p(X^4) < A+l^ 



hence 



2(5 

jim lim^^o^logP <{ p{X^, Sq) < — 



< max <{ jim lim^^o^ log P {pK: > ^} ; ]^io^ log ^ i 4) < + 

25" 



li^l,^o^logP<jp(X^4) < A+ g 



for any A > 0, we have used the assumption that 

hm ljm^:QslogP{p{X'^,X') > A} = 0. 

n-+oo 

On the other hand, as {X^ : e e (0, 1)} satisfies the large deviation principle 
with rate function 7^, so that 



and therefore 



lim^.o^ log P { p{Xl 4) < ^ !> > - inf /;(.') 

3 I s'ei3(4,f ) 



lim,^o^logp|p(X^4) < A + ^1 > - hm inf I'^{s') (2.15) 
for any A > 0. 

According to the assumption that inf^,g^(^^_|) — M < oo. Since I' 

is a good rate function, there is an s[ e B{s'o, ^5), such that r{s[) = M. 
Since 

I'{s') = inf{/(s) \ seH such that F{s) = s'} 

and since / is a good rate function on E, there is an Si G i7 such that I{si) — 
M and F(si) = s\. Let ^ = F„(si) G Then lim„_oo4 = -f'l^i) = «i> so 
that for every a > there exists an Nq such that 

t'^eB(s[,a) yn>No. 

Then for n > Nq we have 

, inf < lUO 

= inf{/(s) : s e // and F„(s) = t'^} 



Choose a — -^5. Then 



inf < I{s,) = M 

3 



inf J'(s') 
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so that 

lim inf I'^{s') < inf J'(s') Vn > No 

Hence 



lim,^o^logp(p(X^s'o)<A + ^l>- inf J'(s') (2.16) 

for any A > 0, which imphes f l2.14p . 

Now prove the upper bound: for any closed set 5" in E, 

h^eioe^ogfieiS) < - inf I' is'). (2.17) 

For any 5 > 0, set 

= {s' eE \ s" e S s.t. p(s', s") < 5}. 

Then 

P{X' eS}<P{X'^eS'] + P {p(X^ X^) > 5} 

and therefore 

log P {X' G S} < log 2 + log [P {X'^eS^]y P {p(X^ x^) > 6}] 

so that, for any 5 > 

in^,io£logP{X= G S} 
< [IhK,io^logP{X^ G 5^} V^E^,io^logP{p(X^X^) > 5}] . 

For any i^' > there is an N2 depending only on 5 and such that 

h^^^oe logP {p(X^ X^) > 5} < -/^ > X2 . 

On the other hand, {X^ : e G (0, 1)} satisfies the large deviation principle 
with rate function J^, so that 

Ih^,io£logP{X^ G S} 

< (-K)vIh^^,io^logP{X^G^^} 

< max I - inf /;, -K^ Wn > N2. 
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It follows that 



lim,^o£ log P{X' eS}<- lim lim„_^ mf I'„ . (2.18) 

(5J,0 gs 



Let us consider 



If / = (X), then 



/ = limlim„_,^inf /; . 

5J.0 gs 



lim,^oe\ogP{X' e S} = -oo 

< - inf I'(s') 

s'es 

so let us assume that I < oo. In this case we show that 

inf/'</ = limlim„^^infJ^ (2.19) 

F 510 gs 

In this case, by definition of the multiple limits of the right-hand side of 
(I2.19P we may choose a sequence (sm) C Ki+i, and a subsequence rim — ^ oo 
such that ^ 

m 

and ^ 

/(S^m) < / + — . 

m 

Then {s^} C K^+i. Since -ft'j+i is compact in E, and weakly compact in H, 
we can further assume that — ^ s in Ki+i (in the distance p), and — >■ s 
weakly in H. Since / is lower semi-continuous, I{s) < I. 
For any a > 0, there is a number such that 

p(F„„(s),F(s))<- V.GiTm (2.20) 

for any m > N^. In particular 

CK 

p{s'^,F{sm))<- Vm>iV3. 

However {F(sm) : m = 1, 2, ■ ■ ■ } C F{Ki^i) which is compact in E. There- 
fore, if necessary by extracting a subsequence, we may assume {F{sm)} con- 
verges in E to s'. Hence, there is an A^4 such that 

p{F{sm),s)<- Wm>N^. 
21 



Therefore 

p{s'^, s') <a Vm > ATg V A^4 • 

That is, s'^ s' in E, so that s' G F. On the other hand, Sm ^ s weakly 
in H, so that, as F is weakly continuous on Ki^i, F{sm) F{s) weakly in 
E. We thus must have F{s) = s' G -F and I{s) < I. Therefore inipl' < I 
which completes the proof of (12 .191) . ■ 

3 Hypercontractivity and martingales 

Let us retain the notations we have established in Introduction. In particular, 
(Wq, J-'i, P^) is the Wiener space on W^. However, for simplicity, we may 
assume that d = 1 without loss of generality. 
If / G L2[0,1]" we use 

Jn{f)t = / fih, ■ ■ ■ , tn)dwt^ ■ ■ ■ dwt„ 

J0<tl<---<tn<t 

to denote the multiple Wiener-Ito integral on [0,t], t G [0, 1]. {Jn(/)t} is a 
square- integrable martingale up to time 1. According to Wiener-Ito's chaos 
decomposition (121], HQ]), if ^ G L\W^, Ti, P"^), then 

OD 

n=l 

for a sequence of symmetric functions /„ G L'^[0, 1]" and 

ll'C - E^Wl = ^ ^II/«.|Il2[o,i]" 

n=l 

where denotes the L^-norm of C,- The Ornstein-Uhlenbeck semigroup 
{Pt)t>o is defined by 

oo 
n=l 

{Pt)t>o is a symmetric diffusion semigroup on L^(Wq, jFi, P*"), which may 
be extended uniquely to a strongly continuous semigroup on L^'(Wq, jFi, P"^) 
for every p > 1. 

The following hypercontractivity of the Ornstein-Uhlenbeck semigroup 
plays a major rule in this paper. 
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Theorem 3.1 (L. Gross, Nelson J2U]) The Ornstein-Uhlenbeck semigroup 
{Pt)t>o possesses the hypercontractivity 

\\m\p(t)<\mp 

for all ^ e L^iW^, Tx, P""), p > I and t > 0, where p{t) = 1 + {p - l)e^\ 

As an application of the hypercontractivity, we present a proof of the 
following estimate, a well-known result in Gaussian analysis, which shows 
that tail behaviors of multiple Wiener-Ito integrals. 

Proposition 3.2 Let ^ = In{f)iwhere f E L'^{[0, 1]"") for some n. Let Yt = 
P"" {.i\^t) for t e [0, 1]. Then for any a < n/(2e) 



Eexp a 



1 



sup Yt 



2 t<l 



2/n^ 



< c 



(3.1) 



where 



„ 2e 1 f2ae 



Therefore, for any 5 > 



sup 



t<i 



>5\< Ca,n exp < -a 



(3.2) 



Proof. Without losing generality, we may assume that 1 1^| I2 = 1- Accord- 
ing to the hypercontractivity of (Pt), Pt^ = e""*.^ G L^*-*-* where p{t) = l-|-e^*, 
and 

l/(l+e2*) 



< e 



nt 



Vt > 



that is for any p > 1, ^ G ^^(W;^;, J^i, P^), and 

£^|er < (p-1)"''^' Vp>l. 
By Doob's inequality, supt<i G Lp(W^, J^i, P"') and 

p 



E 



sup Ft 

i<l 



< 



P — 1 



< epijp — 1) 



(p-1) 



np/2 
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for any p > 1. Since 



E exp a 



sup Yt 



t<i 



fc=0 



snpYt 



t<i 

k 



ek 



< 



ke<2 

1 + E 



sup Yt 



ke k 



ke<2 



a 
IT 









- 


sup Ft 


1 




t<i 





ke>2 



a'' ke 

Idke-i 



{ke - 1 



ke>2 
ek/2 



,nke/2 



sup Yt 

t<i 



ke 



ke<2 



ke>2 



ke 

Tike - 1 



choosing e = 2/n, we thus have 







2/n\ 




sup Ft 






t<T 





~ ^ A;! ^ A;! V 

k<n ke>2 ^ 



According to Stirhng's formula 

k'' 



< 



1 e'^ 



< 



27T^/k e i2fe+i 
1 e'^ 



(see page 52, W. Feller [H]) which follows that 

< 1 + 4e" + 







2/n\ 


a 


sup Ft 






t<T 





E:i( 



2ae 



2^ .7^2 V ^ 



the right-hand is finite if a < n/(2e). ■ 

Proposition 3.3 If^ G [W^,J^i,P'") andYt = P"" {i\J^t), then for every 
e G (0, 1) and 5 > 



sup (P_ log ^Ft) 



t<i 



(3.3) 
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Proof. By the previous lemma, -P-iog^^^ G L^^^ (yV^, J^i, P'^^ for any 
e G (0, 1), thus, by Doob's L^-inequahty 



E 



sup {P^ log ^Yt) 

t<T 



< il + e)'+^E\P_io^^^\ 



the second inequahty follows from the hypercontractivity of the Ornstein- 
Uhlenbeck semigroup {Pt)t>o- Therefore 



P^ 



sup {P- log v^Yt) 

t<T 



> 6^ < 



1 



-E 



sup {P- log ^Yt) 



t<T 



1+i 



< {l + e 



4 Ito's mappings defined by Ito's multiple in- 
tegrals 

The large deviation principle for multiple Wiener-Ito integrals has been es- 
tablished in M. Ledoux [25], also in ^31|. We believe their arguments, with a 
little bit of extra work, can equally apply to the case of finite sum of multiple 
Ito's integrals. For completeness we however include a different proof, which 
we believe has independent interest by its own. 

Our approach is to apply the contraction principle to the Ito-Lyons map- 
pings on spaces of geometric rough paths. Not like the original Ito's map- 
pings defined by solving stochastic differential equations via Ito's calculus, 
Ito-Lyons mappings will serve the same aim as that of Ito mappings, but in 
addition they are continuous with respect to variation distances. The main 
concept and the continuity result were established in an important work by 
T. Lyons [2S] (see also pD], the excellent recent books [30], [H] etc), which 
says solutions to Stratonovich type stochastic differential equations are con- 
tinuous functions of Brownian motion paths together with its Levy area. A 
more precise statement, see items 1 and 2 in Theorem 14. II below. 

Lyons' continuity theorem, or called the universal limit theorem as sug- 
gested by Malliavin, has been finding many applications in analyzing Wiener 
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functionals, for example, see the recent articles by Hambly and Lyons [2T], 
Cass and Friz ^ and etc. The important fact here is that, the rough path 
analysis, as developed in [29], allows us more effectively to apply classical 
functional analytic techniques to stochastic analysis. The result in this sec- 
tion is another example of the power of this new analysis. 



4.1 Schilder's theorem in the variation topology 

In M. Ledoux, Z. Qian and T. Zhang [26j, a version of the large deviation 
principle of Schilder's in the p-variation topology has been established, with 
which we will prove the large deviation principle for martingales. 

Let p G (2, 3) be a fixed constant. Let W be the space of all continuous 
path w G Wq which has finite total variations over [0, 1]: 



sup 

D 



J2H,-Wt,_, \ < +00 



where D runs over all finite partitions {0 = to < < • • ■ < = 1} of the 
interval [0, 1]. For a path G W we may consider its increment wl ^ = Wt—Wg 
and its Levy area 



lUgj- = / dwt^ ® dwt2 

Js<ti<t2<t 

defined via Riemann sum limits, w"^ can be considered as a d x d matrix- 
valued function on A = {(s, t) : < s < t < 1}. Then define 

w,,t = {l,wl^,wlt) if {s,t) G A 

and w : (s, t) G A — j which is called the rough path associated to 
w G W, a path of finite variations. The space of all such rough paths is 
denoted by W°° (and we may thus identify W with its "lift" W°°), equipped 
with a natural metric dp (called the p- variation metric where p G (2, 3)) 



dp{w, y) = sup ( 5^ - yl-iA I +SUP I l^ti A - yl-iM ^"''^ 



D \ , / D 



(4.1) 

Since any w G i^Q ([0, 1]; i?'^) has a finite variation on [0,1], therefore the 
Cameron-Martin space ifg ([0, 1]; -R*^) may be considered as a subspace of 
W°^, hence of to be introduced later on. 



26 



The completion of W°° under the p- variation metric dp is denoted by W^. 
T. Lyons [2B] has estabhshed the following result. Consider the ordinary 
differential equation 

d 

dyl = f^{t, yt)dt + yt) ° dwl yo = (4.2) 

i=i 

i = 1, ■ ■ ■ ,m, where we have used odwl to denote the usual differential if 
w is different iable, to indicate ( 14. 2p should be understood as Stratonovich 
stochastic differential equations for Brownian motion w. Bot interpretation 
of (14 ■2p within the setting of rough path analysis. 

If /j, /q are functions, then w — y is continuous map from W°° into 
W°° under p-variation metric dp and therefore extended continuously to be 
a map from into W^, called the Ito- Lyons mapping determined by (14. 2p . 
This result, together with the following theorem proved in [22] and Ledoux, 
Qian and Zhang [22], can be used to estabhsh large deviation principles for 
a large class of Ito's functionals. 

Theorem 4.1 Let p G (2,3) be a fixed a constant. Let (Wq, jFi,P"') be the 
d- dimensional Wiener space, so that its coordinate process {wt)t£[o,i] is an 
R^-valued Brownian motion. Let 2 < p < 3 be a fixed constant. Set 

wit = Wt-Ws 

and 

wi t = / °dwt^ ® odwt2 

Js<tl<t2<t 

where od denotes the Stratonovich integration. Let w^^t = (1, ti?^ J. The 
law of {^s,t '■ {s.t) G A} is denoted by which is a probability measure on 

1. For any w G W there is a unique solution y of ^4.2^ which belongs to 
W, denoted by G{w). Their corresponding geometric rough paths are 
denoted by w E W°° and G'(w) G W°°. The mapping G : w ^ F{w) 
can be uniquely extended to be a continuous mapping from (W^, dp) to 
(WP,dp), denoted again by G, called the Ito-Lyons mapping defined by 
^.S^ . Moreover, the projection to the first level path, yt = G'^(w)o,t 
is a version of the strong solution of Iji4-^ on the probability space 
[W^,B{WP),P'^). The results remain true if all /j are linear in the 
space variables, with bounded derivatives in t. 
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2. We have 

P"'{r(e)w G W^' : V£ > 0} = 1 
where Y[e)^s,t = (1, V^w^u ^^s,t)- 

3. Let be the distribution of {Y{£)"^s,t)o<s<t<i' ^ probability measure 
on (W^,i3(WP)). Then {P^ : e > 0} possesses the large deviation 
principle with respect to the topology induced by the p-variation metric, 
with rate function 

1 

0(w) = - \w{t)\^dt, 

if w E W°° such that its first level path w G Hi{[0, 1]; R^), otherwise 
0(w) = oo. 

The first item in the theorem is called the universal limit theorem of 
Lyons', the second item says the Brownian motion may be lifted to geometric 
rough paths, and the last item is Schilder's large deviation principle in the 
]9- variation metric, proved in Ledoux-Qian-Zhang p6j . 

4.2 Several elementary facts 

In this part we present some important facts about the relationship between 
multiple Wiener-Ito integrals and solutions of stochastic differential equations 
of Stratonovich type. To this end we need to introduce more notations. 

If / G L^(_R"), then = {Jn{f)t} is the process of n-th multiple 

Wiener-Ito integrals where 

Jn{f)t= f{ti,---,tn)dWt^---dWt„ 

Jo<ti<-<t„<t 

which is a martingale for n > 1. 

It occurs in the computations below some "partial" multiple Wiener-Ito 
integrals which are no-longer martingales. Here is a typical example. 

If / is a function of variables (ti, ■ ■ ■ ,tn), then for 1 < k < n we use 
f-k{'] t) to denote the function of (ti, ■ ■ ■ , tk): 

f;k{-] t) : {ti, ■ ■ ■ ,tk) ^ f{ti, ■ ■ ■ ,tk,t, - ■ ■ ,t). 
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Then /.„ = /. The following stochastic process 

JkU-A'] t))t = / /(ti, ■ ■ ■ ,tk,t,- ■ ■ , t)dwt^ ■ ■ ■ dwt„ 

Jo<ti<-<tk<t 

is well-defined, for example, if / is differentiable in all variables. 

In what follows, we always consider a function / of n variables in the 
order from left to right (i.e. we use the standard coordinate system in i?"), 
and Vj/ denotes the partial derivative in the j-th coordinate, i.e. -^f . 



dJn{f)t = Jn-1 (/;n-l(-; t)\ O dWt - -Jn-2{{V nf) ,n-2 (S i))^^ (4-3) 



Lemma 4.2 If f{ti, ■ ■ ■ ,tn) is smooth with bounded derivatives, then 

1 
2 

where odwt denotes the Stratonovich differential. 
Proof. By definition 



Jn{f)t = / Jn-1 (/;n-l(-; s))^ dw^ ■ 

Jo 



To simplify our proof, let Zt = Jn-i (/;n-i(-; ^))t so that Jn{f)t = Jo ZgdiVg. 



Therefore 



Jn{f)t = y ZsOdWs- -{Z, w)t 



and we aim to compute the bracket process (Z, w)t. To this end, we begin 
with the case that 

/n(^l, • • • , tn-1-, tn) = 5'n-l(^l, • • • , tn-l)g{'tn)- 

Then, according to integration by parts 

Zt = g{t)Jn-i{gn-i)t 

= / g'{s)Jn-i{gn-i)sds+ / g'{s)dJn-i{gn-i)s 
Jo Jo 

g'{s)Jn-i{gn-i)sds 







+ / g'{s)Jn-2{gn-l;n-2{-;s))sdWs 

Jo 
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which follows that 



{Z,w)t = / 9\s)Jn-2{9n-l;n-2{-]s))sds 




Jn-2{9n-\;n-2 




Jo 

It is immediate that this equality holds for general /, and thus proves the 
lemma. ■ 

Lemma 4.3 Let fnih, ■ ■ ■ ,tn) be a smooth symmetric function, let 1 < k < 
n, and consider ltd 's multiple integral 



Hi 




Then 



n 



dHi 



t 



j=k+l 




(4.4) 



Proof. Let us consider the case that 



fn{tl, • • • ,tk, tk+l, ■ • ■ ,tn) — gk{tl, ■ • • , tk)gk+l{tk+l) ■ • ■ gn{tn) 



SO that 



fn{ti, ■ ■ ■ ,tk,t, • ■ • ,t) — gk{ti, ■ ■ ■ , tk)g{t) 



with 



g{t) ^ gk+i{t)---gn{t)- 
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Then, by integration by parts. 



9\t)Jk{gk)tdt + g{t)dJk{gk)t 
g'{t)Jk{gk)tdt + g{t)Jk~i {gk;k-i{-\ t))^ o dwt 

~g{t)Jk-2{{^kgk).k-2 



n 



Jk{{^jfn).k + Jk-l {fn;k-l{-\ t))^ O rfwj 



j=k+l 



--■h-2{iykfn),k-2i-^t))dt 



which proves the lemma. ■ 

4.3 Stochastic differential equations 

Let ^ = Yl!l=i Jn{fn)i G -^^^(Wq, jFi, P"') (but in the following computations, 
we assume for simplicity that d = 1) for some natural number N and smooth 
functions /„ on [0, 1]" with bounded derivatives, and 



The aim of this section is to construct a continuous function on 
(W^, BiW^), P^) (where the space of geometric rough paths is endowed 
with the p-variation distance), such that F^(r(e)w) = y^(w) almost surely. 

To this end, we demonstrate that is a part of the solution of a 
Stratonovich type stochastic differential equation, at least for good functions 



N 





(4.5) 



n=l 



fn- 



According to (14. 4p 



N 




n=l 




n=l 
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and 



dJn-l {fn;n-l{-',t))f — V„ [Jn-l{fn;n-l{-', t))t\ dt 

+ Jn-2 {fn;n-2{-;t))^0 dWt 
1 
2 



--J,_,{{Vn-lfn).,n.s{-;t))dt. (4.6) 



where 



Vn [Jn-l{fn;n-l{'','t))t] 
d f 

= / fn{tl, • • • , ^n-l, tn)dWt^ ■ ■ ■ dWt„_-,. 

Unfortunately it docs not lead to a closed system of stochastic differential 
equations of Stratonovich type. Therefore we consider a special case in which 
each fn is a hnear combination of functions of product form. We need some 
more notations. 

For e G (0, 1), n G N, and {g; P-, - ■ ■ , /"} a family of smooth functions 
on [0, 1] with bounded derivatives, then we define 

Zn,{9-J\-,n ^ ^t^(^) I . ..fnit^)dnj,^ . ..dwt^ (4.7) 

Jo<ti<-<t„<t 

and 

- I ^n,{fl;/i,-,/"} 



Therefore 



Z^'^'-'f''-'^''^ = £t /■ f\h)---r{Qdw,,---dw,^, 

Jo<ti<-<t„<t 

and contains just two identical copies of '■' \ We use 

the convention that = 1 and = for n< 0. 

Lemma 4.4 T/ie stochastic process ' ' satisfies the following re- 

cursion equations 

--££;22Xr''^^^""^"^^''-'^""^rfi (4.8) 
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where Eij is the 2x2 matrices with 1 at {i,j) entry and other entries zero. 

Proof. It follows from (14.41) that Z^"^^"^ "'""^ ^ satisfies the following 
stochastic differential equation 

rfZr'^^^^^'- = Vizr^'^^^"^^^'- o dw, + e'^g'mT'-"'^^'" '^"^dt 

_i^"-2,{9r-lr;/^■■■,/"-"}^^ 

2 * 

and 

^^n,{l;/^...,r} ^ ^^n-i,{r;/\...,r-} ^ _ i^^n-ur-^r-j\-,r-^}^^ 

which is equivalent to (14.81) . ■ 
Now let us consider 

where ^ = Yln=i Jn{fn)i e ^^(W^, J^i, P'") with each integrand /„(ti, ■ ■ ■ , ^n) 
has a product form, say 

/„(tl,---,tO= E Cr^''fnitl)---fn{tn) (4.9) 
ii,-- ,in=l 

where C^^'"^" are constants, A/^^ is a natural number, and all fl' are smooth 
functions with bounded derivatives. 
In this case 

N Nn n 

n=lii,..,j„=l ^0<ti<-<t„<t 

(4.10) 

We are going to show that is part of the solution to a Stratonovich type 
stochastic differential equation. More precisely, we are going to show that 

is the unique strong solution to a system of stochastic differential equations 
of Stratonovich type, where the general term is given by 

^nAj ^ ^^M^r.,,.rJi\-f-'} for A; = 1, ■ • • , n - 1, J = 0, ■ ■ ■ , - 1 
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and gn,k,j = riiL/ '■ The projection to the first component in fl4.11l) . i.e. 

{y,y, (a;"''''^)o<i<fc<n,i<n<7v) y 

will be denoted by tti. 

Proposition 4.5 Let Zt = ("K/, Yf) where {Yf)t<i be given by equation 
dT^Tiy , and /ei 

\ ^ / 0<j<k<n,l<n<Ny 

Then the stochastic process Xt is the unique strong solution to the following 
system of Stratonovich stochastic differential equations 

N TV 

dZt = v^Yl Yl cr-'''{Ei2+E22)x:^'^'odwt 

'^=1 il.--- ,jn = l 

^ N N 

-2"E E Cr'-{Eu + E,,)X:''''dt, (4.12) 
c^^r''" = e"-^g'^/t)E,,X:'''^dt 

+ i522xr'''"''') ° (4.13) 

/or 0<j<k<n<N, X"'"'"' = 1 anc? X"'^'-' = /or any k > n. The 
system (^TT^UT^ can be written into a compact form 

dXt = ^A{Xt)odwt + eB{Xt)dt 

N-l 

+ J2e~'Ck{t,Xt)dt (4.14) 

k=l 

where all A, B and Ck defined by lj^4-lSl^i4^TS\ ) are linear in the space variable, 
with bounded derivatives in t. odwt denotes the Stratonovich differential. 

Proof. By definition 

N N 

yt=Y. cr^-z^^^^--^''\ (4.15) 
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Instead of considering we take two copies of the same equation, i.e. we 
consider 



SO that 



N N 
n=l ji,--- ,jn=l 



N N 



dZI = Y^ Yl C'^''"'"c^^r • (4.16) 
Using (14. 8 p we obtain 

--e {Ei2 + E22) X""^'^^" Mt(4.17) 

so that 

AT iV 

1 ^ ^ ■ i ■ J 

--e E E ^^12 + E22) xr''^^""^""'''^"''"' '^""'74^18) 

Now repeating the use of Lemma 14.41 we obtain 

^^n-l,{/^";/n^■■•,/^-^ 

dt ^ 2 * 

2 * 

+ ( EisXf"^'^-^"""''^"''"''^"""'^ + ^22Xf"^'^^'"^"""''^"'''"'^"""'M O dWt, 
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n-2,{/^-l;/^l,...,/^-2} 



dt 



2 * 

1 ^ /I r f^n — 3 f^n — 2 — 1 rJl r^ri — 4i 



o (ity^ 



and 



n-2,{/,i"/^-i;/^i,..,/^~2} 



dt * 

+v/e ^12X4 '^'^ ' ^ + ^22X4 ^" ^" ' Mo dWt 

1 *i /I S f^ri — Z — 2 r31 r3n — A\ 

2 

1 4 / Jn-S r3n-2 Jn-1 rjn . rJl 

££'22^ ^ dt 

and so on. The general term appearing in this system is 

_ 7 J rJl f^n—k \ 

where S'n.fcj- = lit/ ''^ '^uns through 1 up to N. We have thus completed 
the proof. ■ 



4.4 Ito-Lyons mappings 

Let ^ = X]n=i Jn{fn)i be given by (14. 9p and use the notations in the pre- 
vious sub-section. For each 5 G (0, 1), we consider the following differential 
equation fl4.14p 

dXt = A{Xt)odwt + 5B{Xt)dt 

N-l 

+ Y,^~'Ck{t,Xt)dt (4.19) 
fc=i 
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on the rough path space W^, where A, B and are given in Proposition 
14. 5[ According to Theorem I4.H the differential equation fl4.19p defines an 
Ito-Lyons mapping which is continuous with respect to the p-variation 
topology. The projection of to the first component Y in Proposition 14.51 
of the first level path (the projection is denoted by tti) is then denoted by . 

That is F\^N)t = tti I G^ (w)q ^ . We also consider the differential equation 

dXt = A{Xt) o dwt, Xo = 

whose corresponding Ito-Lyons mappings are denoted by G and F (i.e. G = 
and F = F^). 

Let us list some properties about F^ . 

Recall that is the space of all rough paths inR'^ endowed with the 
p- variation metric, fi is the distribution of Brownian motion with its area pro- 
cess, (Wq,JFi,P"') is the Wiener space, and Wq equipped with the uniform 
norm. The natural projection tt : Wq which takes w = (l,w^j,Wg^) 

to its first level path w : t G [0, 1] ^ ^ is continuous. 

Proposition 4.6 1) For each 5 G (0,1), F^ : Wq is continuous, 

where is equipped with the p-variation metric, Wq endowed with the 
uniform norm. 

2) If he H^{[0, 1]; R^), then F" {r{6)h)^ = 7Ti{xt) where (xt) is the unique 
solution to the ordinary differential equation 

N-l 

dxt = y/eA{xt)dht + eB{xt)dt + ^ e^Gkit, xt)dt. (4.20) 

k=l 

3) For every e G (0, 1), we have 

P"" {w : F' (r(e)w), = F/(w;) Vt G [0, 1]} = 1. (4.21) 

where w = j is the first level path of w = (1, w^, w^) G W'''. 

4) We have 

P"" l^w: F{T{e)w)^ = Sl{w) VtG[0,l]} = l. (4.22) 

where 

St = X] / •^"'^^1' ■ ■ ■ ' ° ■ ■ ■ ° 



37 



and, if he W°° such that t ^ ht = G i^o ([0, 1]; R"^), then 



F (h) = V / U{ti, ■■■ , tn)h{ti) ■ ■ ■ h{tn)dt, 
n=l 



■dtr^ 



Proof. The first claim and second claim follow from Lyons' continuity 
theorem, Theorem 14.11 3) follows from Proposition 14.51 and Theorem 14.11 
The last item comes from the fact that the terms involving vector fields B 
and Ck come from the correction terms from Ito integrals to Stratonovich's 
integrals, therefor if we started with the multiple Stratonovich's integrals 
(or ordinary integrals), all these terms disappeared. We thus completed the 
proof. ■ 

Proposition 4.7 Let 6 > 0. Consider the solutions {xl)t>o and (yf)t>o be 
the solutions to Stratonovich differential equations 

dyt = y/eA{yt)odwt + eB{yt)dt 

+ ^e^Ckit,yt)dt , yo = 

k=l 

and 

dxt = y^A{xt) o dwt, Xo = 

on (Wq, J-'i, P^) , respectively, where A,B and Ck are given in Proposition 
Then 



limelogP"' < sup |vri(xf) - i^iiyDl > 5} = -oo. 
[te[o,i] J 

Proof. According to the definition our system 

T^iiVt) = 5Z / •^"(^1' ■ ■ ■ ' ^n)dwt^ ■ ■ ■ dwt„ 

n=l 



and 

N 



M4) 



y^^^ fnih,- ■ ■ ,tn) O dWt^- ■ ■ O dWt 

n=l 
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so that (for example, by applying Hu-Meyer formula [2^, see also [21]) 

limelogP'" < sup \tti{xI) — vri(|/j)| > 6> = —oo. 
[te[o,i] J 

■ 

Corollary 4.8 Let he the law of {'7Ti{yf))t£[o,i]- Then the family {vg : e G 
(0, 1)} satisfies the large deviation principle with the rate function given by 

I'j^iw) = inf {1(h) -.heHs.t. ^h) = w} (4.23) 
where 1(h) = \ \h(t)\'^dt for h e H, and 

^h)t = J2 / fn{tu ■ ■ ■ , tn)h(ti) ■ ■ ■ h(tn)dti ■ ■ ■ dt^. (4.24) 

n=l 

Proof. Let G be the Ito-Lyons mapping determined by the differential 
equation 

dxt = A(xt) o dwt, Xq = 
on WP. Then G : ^ is continuous. Define F : ^ W[J by 

F(w), = TTi (g(w);,) 
which is continuous, and moreover F(r(e)w)t = 7ri(xf), and 

F(\l)t = V / /„(ti, ■ ■ ■ , t^)h(ti) ■ ■ ■ h(tn)dti ■■■dt^ 

n=l 

for any h G W°° such that h = 7Ti(h) G H^([0,1]; R'^). It follows from 
Theorem 14. 11 the distributions of ('7ri(xf)) satisfy the large deviation principle 
with the good rate function defined by fl4.23p . Now, according to Theorem 
4.2.13 on page 130, |i7| and Proposition 14. 7[ one may conclude that {z/^ : e G 
(0, 1)} satisfies the large deviation principle with the same rate function. The 
proof is complete. ■ 
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5 Large deviations for martingales 

In this section we extend the large deviation principle to a general square- 
integrable martingale on (Wq, JF^, JF^, P"'). 

Let ^ G L^(Wq, J-'i, P^) with mean zero, whose Wiener-Ito chaos decom- 
position ^ = J2kLi -^fc(/fc)i)Where fk G L'^[0, l]^ for every k and 

oo ^ 
n=l 

Let Yt = P'^i^lTt) = Er=i Mfkh and for each e G (0, 1) 

oo 

Yt = P-io^V^Yt = 5^^^ yt G [0, 1]. 

k=l 

Let z/e be the law of (F/)ig[o,i] which is a probability measure on (Wq, jFj^). 
In this section, we prove the main result, Theorem ll.il that is, we show that 
{u^ : £ G (0, 1)} satisfies the large deviation principle on (Wq, || ■ ||). 

The idea, as we have mentioned, is to construct a sequence of exponential 
approximations to (F/)tg[o,i]. For each natural number n, there is a natural 
number iV„ such that 



^ T7ll/A-llr2rni,fc < 



k=Nn+l 



1 



For each k = 1, ■ ■ ■ , Nn, choose a symmetric function fj} on [0, l]'^ which has 
a product form 



jl,-- ,jn = l 



where C^^^'"'*' are constants, Nn^k is a and all are smooth functions on 
[0, 1] with bounded derivatives, such that 



Wh - fkWlno,!]^ < forA; = l,---,iV, 

Define U = T.kliWk)i- Then 



fc=l 
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so that 

Nn . OO 



ll^~^n||2 = '^l^llfk - fk\\L'2[0,l]k + ^ ^ll/fcllL2[0,l]'= 
k=l ' k=N„+l 

1 



which imphes that ^„ — > ^ in L^(Wq, JF^, P^). We of course can choose A''^ 
increasing in n. It is obvious that for each k, fj^^fk^^sn^ oo. 
Let Y{n)t = £^'^(^^1^0 and 



k=l 

Let denote the distribution of (Y{n)f). 

Let An, Bni Cj^n bc the corresponding vector fields determined in Propo- 
sition S2] for each in place of ^. Then Y{n)l = Tri{y^'^), where {yt'^)t>o is 
the unique strong solution to 

dyt = y/eAn{yt) o dwt + eBniyt)dt 

+ Y^e-^Cj,n{t,yt)dt , 1/0 = (5.1) 
i=i 

on (Wq, jFi, P""). Let X{n)l = 7ri(x^'") where (Xf'")4>o is the unique strong 
solution to 

dxt = ^/eAn{xt) o dwt, Xq = 0. (5.2) 

Lemma 5.1 Both families {(^(^)f)t<i : £ G (0, l)}n=i,2,--- and {(X(n)f)^<-^ : 
e G (0, l)}„=i,2,--- converge to {{Y^ )^^^ : e G (0,1)} exponentially. That is, 
for each 6 > 0, 



lim lime log P"' <^ sup \Y{n)l - F/| > 5 ^ = -oo (5.3) 



and 



lim lime log P"' <^ sup \X{n)l - r/| >6} = -oo. (5.4) 
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Proof. By Proposition 14. 7[ we only need to show fl5.3l) . By Lemma [3.3[ 
for any 5 > 0, e G (0, 1) we have 



P"" { sup \Y{n)l - Y^'\>5\<{l + e 



ll^n - ^||2 



1+i 



SO that 



£logP"'<^sup|r(n)^-F/| > 6 
< e(l + -\\og{l + e)-s(l + -^]\og5 



+£(l + -)log||e„-el|2 



e 



Hence 



and therefore 



hmelogP"" <^ sup \Y{n)l - F/| > 5 

< iog|ie„-eii2-iog5 



hm hmelogP*" < sup \Y{n)l — F/l > 6> = — oo 



To prove the large deviation principle for the limit distributions of (y^'^)^<i, 
one would attempt to apply an extended contraction principle (for example, 
Theorem 4.2.23, page 133, [7]) to the exponential approximations X{n)'^. 
Since F„ (w)^ = 7ri(G'„(r(£:)w)J J is a version of X(n)f, which approxi- 
mate : e G (0,1)} exponentially, where Gn is the Ito-Lyons mapping 
on (W^, dp) associated with 

dxt = An{xt) o dwt, xo = 0. (5.5) 

The mapping F„ : (W^, dp) (Wq, || ■ ||) is continuous, and the distribution 
family of {{X{n)f)t<i : e G (0, 1)} satisfies the large deviation principle with 
rate function given by 

r^{w) = inf {I{h) \ heH such that F„(h) = w} 
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where 



F„(h)i = V / fj:{h,--- ,h)hit^)---h{tk)dh---dtk yheH. 

k=i 

Therefore, if F„ were convergent uniformly (in p-variation distance) on any 
level set {I{h) < L} uniformly, one could conclude the proof of Theorem ll.il 

However, unfortunately, as a matter of fact, Fn does not converge uni- 
formly on {I{h) < L} in p-variation metric dp in general, which would require 
a control on the derivatives of fk- Thus, we can not prove our main theorem 
by simply appealing to a (generalized) contraction principle. 

This is the reason why we develop the continuity theorem for large devi- 
ations. Theorem 12.51 

Proof of Theorem 11.11 Let E = Wq with the uniform norm, H = 
H^{[0,1]; R^) with the Sobolev norm || ■ \ \h'^- Then J is a good rate func- 
tion with the effective set {/ < 00} = H. For each consider the following 
mappings Fn : H ^ E, where 

Fn{h)t = J2 / f^ih,--- ,tk)h{h)---h{tk)dh---dh te[0,l] 

k=l Jo<ti<-<tk<t 

andF : H ^ Ehy 

00 „ 

F('^)t = J2 fkitu--- ,tk)hih)---h{tk)dt,---dtk tG[0,l]. 

Then, according to Proposition 12.41 F„, F are rate-function mappings. It is 
easy to see that Fn —>■ F uniformly on any level set = {h : I{h) < L} 
with respect to the uniform norm, thus, according to Corollary 14. 8^ for each 
n, both the laws of {X{ny : e G (0, 1)} and {Y{nY : e G (0, 1)} satisfy the 
large deviation principle on (Wq, || ■ ||) with the common rate function 

I'nis) = inf {1(h) -.heH s.t. Fn{h) = s} 

and {Y{ny : e G (0,1)} goes to {F^ : e G (0,1)} exponentially, therefore, 
by Theorem 12. 5[ {V^ : £ G (0, 1)} satisfies the large deviation principle with 
rate function 

r{s) = inf {1(h) : h e H s.t. F{h) = s} . 
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